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Description 

[0001] The present invention relates to newly identified EGD8 receptors, polynucleotides encoding this receptor, 
polypeptides encoded by such polynucleotides the preparation and the use of thereof. 

[0002] In an effort to identify new G-protein coupled receptors of the EDG (endothelial differentiation gene)-family a 
novel member of the EDG-family of G-protein coupled receptors. Human EDG8, was identified. The full-length clone 
was isolated and studies on chromosomal mapping, tissue expression and identification as afunctional cellular receptor 
for sphingosine 1-phosphatea were performed. Taken together, the data provide compelling evidence that EDG8 is 
the fifth receptor for sphingosine 1 -phosphate, exclusively expressed in peripheral tissues, its presence in endothelial 
cells being responsible for the broad tissue distribution. 

[0003] The lysolipid phosphate mediators lysophosphatidic acid (LPA) and sphingosin 1 -phosphate (S1P) have at- 
tracted Increasing attention as modulators of a variety of important biological functions (Moolenaar et al., 1997; Morris, 
1999; Lynch, 1999) and their list of biological activities is continuously growing. 

Among the biological responses to LPA is platelet aggregation (Durieux and Lynch, 1993; Moolenaar, 1994; Jalink et 
al., 1994; Siess et al., 1 999; Gueguen et al., 1 999), smooth muscle contraction, in vivo vasoactive effects, chemotaxis, 
expression of adhesion molecules on endothelial cells, increased tight junction permeability, activation of membrane 
ion channels and many others. The biochemical signalling events that mediate the cellular effects of LPA include stim- 
ulation of phospholipases, mobilization of intracellular Ca2+, inhibition of adenylyl cyclase, activation of phosphatidyli- 
nositol 3-kinase, activation of the Ras-Raf-MAP kinase cascade and stimulation of Rho-GTPases.(Moolenaar et al., 
1997) 

S1P, in particular, is implicated in cell proliferation, induction/suppression of apoptosis, modulation of cell motility, an- 
giogenesis, tumor invasiveness, platelet activation and neurite retraction. Cellular signalling by S1 P involves activation 
of PLCP and subsequent intracellular Ca2+ release, activation of MAP-kinases, activation of inward rectifying K+-chan- 
nels and inhibition and/or activation of adenylyl cyclase. 

Both, LPA and SI P are recognized to signal cells through a set of G-protein coupled receptors (GPCRs) known as 
EDG (endothelial differentiation gene)-receptors. The EDG-family of GPCRs currently comprises seven human mem- 
bers (EDG 1-7) that fall into two major groups depending on their preference for the activating lipid-ligand: EDG1 , 3, 5 
and 6 preferentially interact with S1 P, EDG2, 4 and 7 preferentially interact with LPA. 

The assignment of specific biological functions to certain receptor subtypes is hampered by the fact that EDG receptors 
are expressed in an overlapping fashion, they activate multiple and in part identical signal transduction pathways, the 
selectivity for their activating ligands is not absolute, and medicinal chemistry is only poorly developed in that specific 
antagonists for dissecting the pharmacology of the individual subtypes are not available yet. 
An important step to shed more light on the biological role of the individual receptor subtypes would be to identify the 
complete set of receptors that respond to the phospholipid mediators S1 P and LPA. 

[0004] The present invention relates to newly identified EGD8 receptors, polynucleotides encoding this receptor, 
polypeptides encoded by such polynucleotides the preparation and the use of thereof. 

[0005] The present invention relates to an isolated polynucleotide comprising a nucleotide sequence that has at least 
80 % identity to a nucleotide sequence encoding the polypeptide of SEQ ID NO. 2 or the corresponding fragment 
thereof; or a nucleotide sequence complementary to said nucleotide sequence. 
[0006] Preferably, the polynucleotide is DNA or RNA. 

Preferably, the nucleotide sequence of the polynucleotide is at least 80 % identical to that contained in SEQ ID NO. 1. 
In another embodiment, the polynucleotide has the nucleotide sequence SEQ ID NO. 1. In another embodiment, the 
polynucleotide encodes the polypeptide of SEQ ID NO. 2 or a fragment thereof. 

[0007] Another aspect to the invention relates to an expression system for the expression of EDG8. The EDG8 DNA 
or RNA molecule comprising an expression system wherein said expression system is capable of producing a polypep- 
tide or a fragment thereof having at least 80 % identity with a nucleotide sequence encoding the polypeptide of SEQ 
ID NO. 2 or said fragment when said expression system is present in a compatible host cell. 
The invention relates to a host cell comprising the expression system. 

[0008] In another aspect, the invention relates to a process for producing a EDG8 polypeptide or fragment comprising 
culturing a host cell comprising the expression system under conditions sufficient for the production of said polypeptide 
or fragment. Preferably, the said polypeptide or fragment is expressed at the surface of said cell. The invention relates 
also to cells produced by this process. 

The process preferably further includes recovering the polypeptide or fragment from the culture. 
[0009] In another aspect, the invention relates to a process for producing a cell which produces a EDG8 polypeptide 
or a fragment thereof comprising transforming or transfecting a host cell with the expression system such that the host 
cell, under appropriate culture conditions, produces a EDG8 polypeptide or fragment. 

[0010] In particular, the invention relates to an EDG8 polypeptide or a fragment thereof comprising an amino acid 
sequence which is at least 80 % identical to the amino acid sequence contained in SEQ ID NO. 2; in particular to an 
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EDG8 polypeptide or a fragment thereof having amino acid sequence SEQ ID NO. 2 or a part thereof. 

[0011] Further, the invention relates to a process for diagnosing a disease or a susceptibility to a disease related to 

expression or acitivity of EDG8 polypeptide comprising: 

5 a) determining the presence or absence of mutation in the nucleotide sequence encoding said EDG8 polypeptide 

in the genome of said subject; and/or 

b) analyzing for the presence or amount of the EDG8 polypeptide expression in a sample derived from said subject. 

[0012] In addition, the invention relates to a method for identifying compounds which bind to EDG8 polypeptide 
10 comprising: 

a) contacting a cell comprising the expression system or a part of such a cell with a candidate compound; and 

b) assessing the ability of said candidate compound to bind to said cells. 

15 [001 3] Preferably, the method further includes determining whether the candidate compound effects a signal gener- 
ated by activation of the EDG8 polypeptide at the surface of the cell, wherein a candidate compound which effects 
production of said signal is identified as an agonist. 

[0014] In another embodiment of the invention, the method further includes determining whether the candidate com- 
pound effects a signal generated by activation of the EDG8 polypeptide at the surface of the cell, wherein a candidate 
20 compound which effects production of said signal is identified as an antagonist. 

[0015] The invention also relates to an agonist or antagonist identified by such methods. 

[0016] In another special embodiment of the method, the mehtod further includes contacting said cell with a known 
agonist for said EDG8 polypeptide; and determining whether the signal generated by said agonist is diminished in the 
presence of said candidate compound, wherein a candidate compound which effects a diminution in said signal is 
25 identified as an antagonist for said EDG8 polypeptide. The known agonist is for example S1 P. LPA and/or DHS1 P. The 
invention also relates to an antagonist identified by the method. 

[0017] The invention in addition, relates to a method of preparing a pharmaceutical composition comprising 



a) identifying a compound which is an agonist or an antagonist of EDG8, 
30 b) preparing the compound, and 

c) optionally mixing the compound with suitable additives. 



[0018] The invention also relates to a pharmaceutical compound prepared by such a process. 

[0019] In the study, we report about the cloning, chromosomal mapping, tissue expression and functional identifica- 

35 tion as a receptor for SI P of a novel GPCR, EDG8, the fifth functional receptor for sphingosine 1 -phosphate. 

[0020] In an effort to identify new G-protein coupled receptors of the EDG-family a database search with alignments 
of the currently known 18 members of this receptor family, comprising human EDG1-7 sequences up to the putative 
EDGs from Xenopus and Zebra-fish was performed. A multiple alignment of these sequences was created by CLUS- 
TALW and used in a PSI-BLAST search to scan translated versions of human genomic DNA sequences, which were 

^0 publicly available in the different EMBL sections. For translation of DNA into protein sequences, individual protein files 
within two respective STOP-codons were created and all proteins shorter than 50 amino acids were ignored. As the 
majority of GPCRs is unspliced searching for GPCRs within genomic sequences should bring about novel receptor 
proteins. 

Performing a PSI-BLAST search, the various cDNAs and genomic contigs, respectively, for the human EDG1 -7 recep- 
45 tors were identified, and an additional genomic hit with a high e-value, that was not identical to any of the published 
EDG-sequences. The nucleotide and amino acid sequence of the new putative GPCR are depicted in Fig.1 A. Hydrop- 
athy analysis (hydrophobicity plot not shown) suggests a seven transmembrane protein with three alternating extra- 
and intracellular loops, assumed to be the heptahelix structure common to GPCRs. 

To shed more, light on the relationships involved in the molecular evolution of the EDG-receptor family, a grow tree 
50 phylogram was constructed using the neighbor joining method (GCG software) (Fig.1 B) According to this phylogenetic 
tree, the human EDG-family can be divided into two distinct groups: EDG1, 3, 5 and 6 belonging to one, EDG2, 4 and 
7 belonging to the other group. These two groups are discriminated further by their preference for different lipid ligands: 
EDG1 , 3, 5, 6 are preferentially stimulated by sphingosin 1 -phosphate (S1 P), EDG2, 4 and 7 by lysophosphatidic acid 
(LPA). The newly identified sequence was named EDGS for reasons of consistency with the existing human EDG- 
55 family nomenclature and exhibited highest similarity (86.8%) to the rat nrg1 -protein (Fig. 1 B), a GPCR recently cloned 
by EST-expression profiling from a rat PC12 cell library (Glickman et al., 1 999). The high similarity between EDG8 and 
the known sphingosin 1 -phosphate (S1P) receptors EDG1, 3 and 5 (48-51%) (Fig. 1C) led us to test the hypothesis 
that EDGS may be a functional S1P-receptor. 
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In testing for S1 P receptor activity, the EDG8 cDNA was introduced into Chinese hamster ovary (CHO) cells by transient 
transfection. CHO cells were chosen as they exhibit minimal responses to sphingosin 1 -phosphate in concentrations 
up to 1 |iM but respond to S1P after transfection with the SIP preferring receptors EDG 1, 3 and 5 (Okamoto et al., 
1998; Kon et al., 1999). To test functional receptor activity, it was decided to monitor the mobilization of intracellular 
5 Ca2+ for three reasons: 

1 . ) SI P has been reported to increase Ca2+ in many cell types; 

2. ) the identification of EDG1, 3, 5 and 6 as receptors for SIP has provided the molecular basis for a GPCR 
mediated mechanism and the receptors are known to mediate intracellular Ca2+ through either PTX-sensttive Gi 

^0 proteins or the PTX-insensitive Gaq/11 pathway; 

3. ) all currently known S1 P-responding EDG-receptors (except EDG6) are present in endothelial cells (A. Nieder- 
nberg et al., submitted), in which intracellular Ca2+ release is a major pathway in the generation of NO, an important 
factor in vascular biology. 

15 Thus, identification of the complete set of SI P receptors, involved in intracellular Ca2+ mobilization could help clarify 
the role of the individual subtypes in endothelial cell signalling. 

[0021] Fig.2 depicts measurement of the intracellular Ca2+ concentration, mediated by SI P via the different SI P- 
receptors EDG1 , 3, 5, 6 and the putative SI P-receptor EDG8, cotransfected in CHO cells with empty vector DNA as 
a control or various G-protein a subunits. Intracellular Ca2+ concentrations were recorded as real time measurements 

20 using the Fluorescence plate imaging reader (FLIPR, Molecular Devices). As has been reported for EDG1, 3 and 5, 
SI P elicited intracellular Ca2+ signals that did not require cotransfection of a G-protein a subunit. As already known 
for a large number of Gq-coupled receptors, coexpression of Gaq augments the EDG1 and 5-mediated Ca2+-release 
as compared with the Ca2+ signal induced by stimulation of endogeneous Gaq. In case of EDG3, additional exoge- 
neously added Gaq did not further improve the signal intensity. In case of EDG6, Yamazaki Y. et al. (2000) reported 

25 an SI P-induced mobilization of intracellular Ca2+ but we failed to detect a significant Ca2+ increase above basal levels 
in the absence of any cotransfected G-protein a subunit. The reason for this discrepancy could be the cellular back- 
ground, as Yamazaki Y. et al. (2000) reported that the Ca2+ signal can be completely abolished in the presence of 
pertussis toxin (PTX), indicating the involvement of Gi-type G-proteins. In this case the Ca2+ signal would be elicited 
by bg, released from activated Gaibg heterotrimers. The Gai-induced Ca2+ signals are known to be much smaller in 

30 intensity as compared with the Ca2+ signals induced by bona-fide Gq-linked receptors (Kostenis et al., 1 997a; Kostenis 
et al., 1997b). It may be that detection of such Ca2+ concentrations is beyond the sensitivity of the FLIPR system. 
EDG8 did not release intracellular Ca2+ when stimulated with S1 P (Fig.2), but gained the ability to mobilize Ca2+ upon 
cotransfection with Gal 6, a G-protein a-subunit, known to couple GPCRs from different functional classes to the Gq- 
PLCP pathway or GaqiS, a mutant G-protein a subunit that confers onto Gi-ltked receptors the ability to stimulate Gq. 

35 These results show that EDG8 as opposed to EDG 3 and 5 is not a bona-fide Gq-coupled receptor but a fuctional 
cellular receptor for SIP. To check, whether the EDG8 receptor also reacts to related lysophospholipid mediators, we 
examined the abilities of lysophosphatidic acid (LPA), dihydrosphingosin 1 -phosphate (DHS1P), sphingosylphospho- 
rylcholine (SPC) and lysophosphatidylcholine (LPC) to increase intracellular Ca2+ in CHO cells transiently transfected 
with the EDG8 receptor and the G-protein a subunits G16 and Gqi5 (Fig.3). Besides SI P, which was the most potent 

40 activator of EDG8, LPA and DHS1 P evoked [Ca2+]i increases in concentrations of 100 and 1000 nM. SPC and LPC, 
respectively, failed to generate any significant response in concentrations up to 1 ^M.. These data show that EDG8 Is 
a SIP preferring receptor, but also responds to related phospholipids like DHS1P or LPA, as has also been reported 
for EDG1 , which is a high affinity receptor for SI P and a low affinity receptor for LPA. 

[0022] Next, the expression pattern of the human EDG8 gene in human tissues was investigated by Northern blot 
45 analysis (Flg.4). Northern blot analysis shows EDG8 expression in several peripheral tissues. Tissues positive for 
EDG8 RNA were skeletal muscle, heart and kidney, lower abundance of RNA was seen in liver and placenta, no signal 
was obtained in brain, thymus, spleen, lung and peripheral blood leukocytes. In ail tissues a single RNA transcript of 
5.5 kb was observed after hybridization with a DIG-labelled EDG8 antisense RNA probe. EDG8 exhibits highest sim- 
ilarity to the rat nrg1 -GPCR with an amino acid identity of 86.8% (Fig.1 B) suggesting that it may be the human homolog 
50 of the rat nrgi protein. However, the expression pattern of human EDG8 is quite different from the rat nrg1 -receptor, 
which is found almost exclusively in brain. This finding suggests that EDG8 may represent a closely related but entirely 
different receptor from nrgi, rather than the human homolog. Never the less, it does not rule out the possibility that 
EDG8 and nrgi are homologs with entirely different, species-dependent expression patterns. 
[0023] As the first member of the EDG-family of GPCRs was cloned as an immediate early gene induced during the 
55 morphogenetic differentiation phase of angiogenesis (HIa and Maciag, 1990) and subsequently cloned from a human 
umbilical vein endothelial cell library exposed to fluid shear stress as an upregulated gene it is reasonable to assume 
that EDG receptors play an important role in the regulation of endothelial function. Therefore, the presence of EDG8 
transcripts in several human endothelial cell lines was analyzed. RT-PCR analysis of human umbilival vein endothelial 
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cells (HUVECs), human coronary artery endothelial cells (HCAECs), human microvascular endothelial cells of the lung 
(HMVEC-L) and human pulmonary artery endothelial cells (HPAEC) revealed EDG8 expression in all cell lines tested 
(Fig.SA). In Fig.SB it is shown that EDG8 specific primers indeed solely amplify EDG8 sequences and none of the 
related EDG1-7 sequences. These findings suggest that the presence of EDG8 in different peripheral organs may be 
5 due to its localization in endothelial cells; it does not rule out, however, that EDG8 transcripts occur in cell types other 
than endothelial cells. 

[0024] The expression of EDG8 in addition to EDG1, 3, and 5 (Rizza at al., 1999) in HUVECS and several other 
endothelial cell lines is intriguing in view of all the reports regarding S1 P effects on endothelial cell signalling. Hisano 
et al. (1 999) reported that SI P protects HUVECS from apoptosis induced by withdrawal of growth factors and stimulates 

10 HUVEC DNA synthesis; the authors derived a model for cell-cell interactions between endothelial cells and platelets 
but the S1 P-receptor responsible for HUVEC-protection of apoptosis could not be identified. Rizza et al., 1 999 reported 
that S1 P plays a role in endothelial cell leukocyte interaction in that S1 P induces expression of cell adhesion molecules 
in human aortic endothelial cells, allowing monocytes and neutrophils to attach. These effects were blocked by pertussis 
toxin, suggesting the involvement of a Gi-coupled S1P receptor. The responsible SI P-receptor subtype, however, 

15 could not be identified and the EDG8 receptor was not included at the time of this study. Expression profiling of all 
EDG receptors in individual celt lines and the use of EDG receptor subtype selective compounds will clearly be nec- 
essary to help determine the role of the individual S1 P receptors in endothelial cell signalling mechanisms. 
[0025] Finally, the mapping of EDG receptors in genomic sequences allowed to allocate a map position for four genes 
of this family (Tab.1). Interestingly, so far, four EDG-receptors including EDG8 are located on chromosome 19. 

20 In conclusion, we isolated a new member of the EDG-family of G-protein coupled receptors, EDG8, and showed that 
it functions as a cellular receptor for sphingosine 1 -phosphate. EDG8 could exclusively be detected in peripheral tissues 
like skeletal muscle, heart and kidney and several human endothelial cell lines. It is conceivable that the expression 
in endothelial cells may account for the broad tissue distribution of this receptor. The existence of at least eight EDG- 
receptors for lysophospholipids suggests that receptor subtype selective agonists and antagonists will essentially be 

25 necessary for a better understanding of the biology of lysophospholipids and their respective receptors. 

Figure legends 

[0026] Fig.lA: The nucleotide and deduced amino acid sequence of human EDG8. The deduced amino acid se- 
30 quence is shown below the nucleotide sequence with the nucleotide positions indicated on the left. The putative seven 
transmembrane domains are underlined. 

[0027] Fig. 1 B: Phylogenetic tree of the EDG-family of receptors. The phylogenetic tree depicted was derived by the 
neighbor joining method method performed with the GCG program. 

[0028] Fig. 1C: Alignment of the amino acid sequence of human EDG8 with the other EDG-family members. The 
35 amino acid sequence of EDG8 is compared with the EDG1-7 polypeptides (EDG1 : accession number M 31 210, EDG2: 
accession number U 8081 1 , EDG3: accession number X 83864, EDG4: accession number af Oil 466, EDGS: accession 
number af 034780, EDG7: accession number af 127138). The approximate boundaries of the seven putative trans- 
membrane domains are boxed. Gaps are introduced to optimize the alignment. 

[0029] Fig.2: Mobilization of intracellular Ca2+ by S1P mediated by the EDG1, 3, 5 and 8 receptor in CHO cells, 
^0 cotransfected with empty vector DNA as a control or the Indicated G-protein a subunits. Agonist-mediated changes of 
intracellular Ca2+ were measured with the FLIPR using the Ca2+-sensitive dye FLU04. Fluorescence of transfected 
cells loaded with FLU04 was recorded before and after addition of SI P, applied in the indicated concentrations. Data 
are expressed as means of quadruplicate determinations in a single experiment. An additional experiment gave similar 
results. 

45 [0030] Fig.3: Effects of S1 P, LPA and related lysophospholipid mediators on EDG8-mediated increase in intracellular 
Ca2+ measured with the FLIPR as described in the legend of Fig.4. The different lipids were applied in concentrations 
of 1 0, 1 00 and 1 000 nM, respectively. Data are means of quadruplicate determinations of a representative experiment. 
Two additional experiments gave similar results. 

[0031] Fig.4: Northern blot analysis of EDG8 in human tissues. Poly(A)+ RNA (l^ig) from various human tissues 
50 (human multiple tissue Northern blots, CLONTECH) was hybridized with probes specific to human EDGS (upper panel) 
and glyceraldehyd-3-phosphate dehydrogenase, GAPDH, (lower panel) on a nylon membrane. The origin of each RNA 
is indicated at the top, the molecular mass of standard markers in kilobases (kb) is shown on the left. 
[0032] Fig.SA: Reverse transcriptase-polymerase chain reaction (RT-PCR) analysis of EDG8 in different human en- 
dothelial cell lines (HUVECS; human umbilical vein endothelial cells; HCAEC: human coronary artery endothelial cells; 
55 HMVEC-L: human microvascular endothelial cells from lung; HPAEC: human pulmonary artery endothelial cells). 
EDG8-specific transcripts were detected in all endothelial cell lines. Agarose gel electrophoresis of the PCR products 
after 35 cycles of amplification is shown. Amplification with EDG8-specific primers yields a 522 bp EDG8-fragment as 
indicated by the arrow. 
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[0033] Fig.SB: PGR analysis of EDG8 primers for specificity of amplification of EDG8 sequences. Primers, specific 
for the EDG8 sequence, were checked for potential amplification of the related EDG1 -7 sequences. The EDG8 specific 
522 bp band occurred only when EDG8 was used as a template. 



Table 1 : 



Chromosomal localization of EDG-receptors 1-8. 

Mapping of EDG receptors to genomic sequences allowed to derive a chromosomal assignment for EDG4, 5, 6 and 
8. The chromosomal localization of EDG1-3 was obtained from the genecards datacollection, 


EDG 


Chromosomal localisation 


according BAC AccNr: 


EDG1 


1p21.1-21.3 


AL161741 


EDG2 


9q31.1-32//18p11-3 


AL1 57881 //AP000882 


EDG3 


9q22.1-q22.2 




EDG4 


19p12 


NT_000939 


EDG5 


19 


AC011511 


EDG6 


19p13.3 


AC011547 


EDG7 


1p22.3-31.2 


AL1 39822 


EDG8 


19 


AC011461 



Examples 



Example 1 : Molecular cloning of the human EDG8 receptor. 

[0034] As the putative human EDG8 sequence is intronless, we cloned the receptor from human genomic DNA 
(CLONTECH, Palo Alto, CA, 94303-4230) via polymerase chain reaction (PCR). PCR conditions, estabHshed to amplify 
the EDG8 sequence were 94°C, 1 min followed by 35 cycles of 94°C, 30sec, 68C, 3 min, using GC-Melt Kit (CLON- 
TECH, Palo Alto, CA). Primers designed to amplify the EDG8 sequence contained a Hindll! site in the fonward, and a 
EcoRI site in the reverse primer, respectively. The 11 97 bp PCR product was cloned into the pCDNA3.1 (+) mammalian 
expression vector (Invitrogen, Carlsbad, California) and sequenced in both directions. 

Example 2: Cell culture and Transfection. 

[0035] CHO-K1 cells were grown in basal ISCOVE medium supplemented with 1 0% fetal bovine serum at 37°C in 
a humidified 5% C02 incubator. For transfections, 2 x 1 0^ cells were seeded into 35-mm dishes. About 24 hr later cells 
were transiently transfected at 50-80% confluency with the indicated receptor and G-protein constructs (l^ig of plasmid 
DNA each) using the Lipofectamine transfection reagent and the supplied protocol (GIBCO). 1 8-24 hr after transfection 
celts were seeded into 96well plates at a density of 50.000 cells per well and cultured for 1 8-24 additional hr until used 
in the functional FLIPR assays. 

The cDNA for Gal 6 was cloned from TF1 cells by RT-PCR and ligated into the pCDNAI.1 mammalian expression 
vector (Invitrogen). Murine wild type Gaq was cloned from cells by RT-PCR and inserted into the SamHI-A/s/l-sites of 
pCDNAI .1 . To create the C-terminally modified Gaqjs subunit, in which the last five aa of wt Gaq were replaced with 
the correspoding Gaj sequence, a 175-bp BgA\'Nsi\ fragment was replaced, in a two piece ligation, with a synthetic 
DNA fragment, containing the desired codon changes. The correctness of all PCR-derived sequences was verified by 
sequencing in both directions. 

Example 3: Fluorometric Imaging Plate Reader (FLIPR) Assay. 

[0036] Twenty-four hours after transfection, cells were splitted into 96-well, black-wall microptates (Corning) at a 
density of 50,000 cells per well. 1 8-24 hr later, cells were loaded with 95^1 of HBSS containing 20 mM Hepes, 2.5 mM 
probenecid, 4 p,M fluorescent calcium indicator dye Fluo4 (Molecular Probes) and 1 % fetal bovine serum for 1 h(37°C, 
5% CO2). Cells were washed three times with HBSS containing 20 mM Hepes and 2.5 mM probenecid in a cell washer. 
After the final wash, the solution was aspirated to a residual volume of 100 |xl per 96 well. Lipid ligands were dissolved 
in DMSO as 2 mM stock solutions (treated with ultrasound when necessary) and diluted at least 1:100 into HBSS 
containing 20 mM HEPES, 2.5 mM probenecid and 0.4 mg/ml fatty acid free bovine serum albumine. Lipids were 
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aliquoted as 2X solutions into a 96 well plate prior to the assay. The fluorometric imaging plate reader (FLIPR, Molecular 
Devices) was programmed to transfer 100 \i\ from each well of the ligand microplate to each well of the cellplate and 
to record fluorescence during 3 min in 1 second intervals during the first minute and 3 second intervals during the last 
two minutes. Total fluorescence counts from the 18-s to 37-s time points are used to determine agonist activity. The 
5 instrument software normalizes the fluorescent reading to give equivalent initial readings at time zero. 

Example 4: Northern Blot analysis. 

[0037] Human multiple tissue Northern blots were purchased from CLONTECH (Palo Alto. CA, 94303-4230, USA) 
10 antisense RNA probes were generated by subcloning nucleotides 279-1197 of the coding region into the Bam Hl-Eco 
Rl sites of the expression vector PSPT1 8 (Roche Diagnostics, Mannheim, Germany) and subsequent random priming 
with a DIG-RNA Labeling kit (Roche Diagnostics, Mannheim, Germany), using T7 RNA polymerase. Hybridization was 
carried out at 68°C for 16 h in hybridization buffer (Dig Easy Hyb Roche Diagnostics, Mannheim, Germany). Each blot 
was washed , blocked and detected as indicated in the standard protocol with the DIG Wash and Block Buffer set 
15 (Roche Diagnostics, Mannheim, Germany) and treated with 1 ml CSPD ready-to-use(Roche Diagnostics, Mannheim, 
Germany) for 15 min, 37°C and developed for 5 min on the Lumiimager (Roche). Finally, each blot was stripped (50 
% formamid,5% SDS, 50 mM Tris/HCI pH 7,5 ; 80° C, 2x 1 hour) and rehybridized with a GAPDH antisense RNA probe 
as an internal standard. 

20 Example 5: RNA Extraction and RT-PCR. 

[0038] RNA was prepared from different endothelial cell lines (HUVECS, HCAEC, HMVEC-L, HPAEC) using the 
TRIzol reagent (Hersteller, Lok.). Briefly, for each endothelial cell line, cells of a subconfluent 25 cm2 tissue culture 
flask were collected In 2,5ml TRIzol and total RNAs were extracted according to the supplied protocol. The purity of 
25 the RNA preparation was checked by veryfying the absence of genomic DNA. An aliquot of RNA, corresponding to 
-'5^g, was used for the cDNA generation using MMLV reverse transcriptase and the RT-PCR kit from STRATAGENE. 
RT-PCR was carried out in a volume of 50 n1, the RT-PCR conditions were set to 65°C for 5 min, 15min at RT, 1 hour 
at 37°C, 5 min at 90°C, chill on ice. 

The cDNA templates for the PCR reactions (35 cycles of 94°C for 30 sec, 68°C for 3 min) were the reverse transcribed 
30 products of RNAs isolated from human endothelial cell lines (HUVECS.HCAEC, HMVEC-L, HPAEC). Typically, 1 -5 ^^ 
of reverse transcribed cDNAs were used as templates for the PCR reactions. 

Example 6: Sources of materials. 

35 [0039] 1-oleoyl-LPA, sphingosin 1 -phosphate (S1P), dihydrosphingosin 1 -phosphate (DHS1P), lysophosphatidyl- 
choline (LPC), sphingosylphosphorylcholine (SPC) and fatty acid free BSA were from SIGMA (P.O.Box 14508, St. 
Louis, Missouri 63178). CHO-K1 cells were obtained from the American Type culture collection (ATCC, Manassas, 
Virginia), cell culture media and sera from GIBCO BRL (Gaithersburg, MD), the Ca fluorescent dye FLU04 and pluronic 
acid from Molecular devices (Sunnyvale CA 94089-1 1 36,USA) human northern blot membrane from CLONTECH (1 020 

40 East Meadow Circle, Palo Alto, California 94303-4230, USA.), commercially available cDNAs (heart, fetal heart, left 
atrium, left ventricle, kidney, brain, liver, lung, aorta) from Invitrogen, oligonucleotides from MWG-Biotech AG (Ebers- 
berg, Germany), the RT-PCR kit from SIGMA, the GC-melt PCR kit from Clontech (Palo Alto, CA), the expression 
plasmid pcDNA3.1 for EDG8 and pCDNAI.1 for expression of G-protein a subunits from Invitrogen (Carlsbad, CA 
92008), competent DH5a from GIBCO and MC 1063 from Invitrogen. 

45 
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List of non-standard abbreviations: 

[0041] S1 P, sphingosine 1 -pliosphate; LPA, lysophosphatidic acid; dHS1 P, dihydro sphingosine 1 -phosphate; SPC, 
sphingosylphosphorylcholine; LPC, lysophosphatidylcholine; GPCR, G-protein-coupled receptor; G-protein, guanine 
nucleotide-binding protein; [Ca2+]j, intracellular Calcium concentration, RT-PCR, reverse transcription polymerase 
chain reaction; bp, base pair; ORF, open reading frame; EST, expressed sequence tag; FAF-BSA, fatty acid free bovine 
serum albumine; HUVECS. Human umbilical vein endothelial cells; HCAEC, human coronary artery endothelial cells; 
HMVEC-L, human microvascular endothelial cells from lung; HPAEC, human pulmonary artery endothelial cells. 

Table 2: 

SEQ ID NO. 1: Nucleotide sequence of human EDG8 

1 ATGGAGTCGGGGCTGCTGCGGCCGGCGCCGGTGAGCGAGGTCATCGTCCTGCATTACAAC 
61 TACACCGGCAAGCTCCGCGGTGCGCGCTACCAGCCGGGTGCCGGCCTGCGCGCCGACGCC 
121 GTGGTGTGCCTGGCGGTGTGCGCCTTCATCGTGCTAGAGAATCTAGCCGTGTTGTTGGTG . 
181 CTCGGACGCCACCCGCGCTTCCACGCTCCCATGTTCCTGCTCCTGGGCAGCCTCACGTTG 
241 TCGGATCTGCTGGCAGGCGCCGCCTACGCCGCCAACATCCTACTGTCGGGGCCGCTCACG 
301 CTGAAACTGTCCCCCGCGCTCTGGTTCGCACGGGAGGGAGGCGTCTTCGTGGCACTCACT 
3 61 GCGTCCGTGCTGAGCCTCCTGGCCATCGCGCTGGAGCGCAGCCTCACCATGGCGCGCAGG 
421 GGGCCCGCGCCCGTCTCCAGTCGGGGGCGCACGCTGGCGATGGCAGCCGCGGCCTGGGGC 
481 GTGTCGCTGCTCCTCGGGCTCCTGCCAGCGCTGGGCTGGAATTGCCTGGGTCGCCTGGAC 
541 GCTTGCTCCACTGTCTTGCCGCTCTACGCCAAGGCCTACGTGCTCTTCTGCGTGCTCGCC 



601 TTCGTGGGCATCCTGGCCGCTATCTGTGCACTCTACGCGCGCATCTACTGCCAGGTACGC 

661 GCCAACGCGCGGCGCCTGCCGGCACGGCCCGGGACTGCGGGG ACC ACCTCGACCCGGGCG 

721 CGTCGCAAGCCGCGCTCGCTGGCCTTGCTGCGCACGCTCAGCGTGGTGCTCCTGGCCTTT 

781 GTGGCATGTTGGGGCCCCCTCTTCCTGCTGCTGTTGCTCGACGTGGCGTGCCCGGCGCGC 

841 ACCTGTCCTGTACTCCTGCAGGCCGATCCCTTCCTGGGACTGGCCATGGCCAACTCACTT 

901 CTGAACCCCATCATCTACACGCTCACCAACCGCGACCTGCGCCACGCGCTCCTGCGCCTG 

961 GTCTGCTGCGGACGCCACTCCTGCGGCAGAGACCCGAGTGGCTCCCAGCAGTCGGCGAGC 

1021 GCGGCTGAGGCTTCCGGGGGCCTGCGCCGCTGCCTGCCCCCGGGCCTTGATGGGAGCTTC 

1081 AGCGGCTCGGAGCGCTCATCGCCCCAGCGCGACGGGCTGGACACCAGCGGCTCCACAGGC 

1141 AGCCCCGGTGCACCCACAGCCGCCCGGACTCTGGTATCAGAACCGGCTGCAGACTGA 
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Table 3: 

SEQ ID NO. 2: Amino acid sequence of human EDG8 

MESGLLRPAPVSEVIVLHYN 
YTGKLRGAR -YQPGAGLRADA 
VVC. LAVCAFIVLEMLAVLLV 
LGRHPRFHAPMFLLLGSLTL 
SDLLAGAAYAANILLSGPLT 
LKLS PALWFAREGGV FVA L T 
ASVLSLLAIALERSLTMA RR 
GPAPVSSRGRTLAMAAAAWG 
VSLLLGLLPALGWNCLGRLD 
ACSTVLPLYAKAYVLFCVLA 
FVGILAAICALYARIYCQVR 
ANARRLPARPGTAGTTSTRA 
RRKPRSLALLRTLSVVLLAF 
VACWGPLFLLLLLDVACPAR 
35 TCPVLLQADPFLGLAMANSL 



10 



15 



20 



25 



30 



40 



45 



50 



LNPIIYTLTNRDLRHALLRL 
VCCGRHSCGRDPSGSQQSAS 
AAEASGGLRRCLPPGLDGSF 
SGSERSSPQRDGLDTSGSTG 
SPGAPTAARTLVSEPAAD* 



55 
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Annex to the application documents - subsequently filed sequences linsting 
[0042] 

SEQUENCE LISTING 

<110> Aventis Pharma Deutschland GmbH 

<120> EDG8 receptor, its preparation and use 

<130> D-2000/A024 

<140> 00108858.2 
<14i> 2000-04-26 

<160> 2 

<170> Patentin Ver. 2.1 

<210> 1 

<211> 1197 

<212> DNA 

<213> Homo sapiens 

<400> 1 

atggagtcgg ggctgctgcg gccggcgccg 

tacaccggca agctccgcgg tgcgcgctac 

gtggtgtgcc tggcggtgtg cgccttcatc 

ctcggacgcc acccgcgctt ccacgctccc 

tcggatctgc tggcaggcgc cgcctacgcc 

ctgaaactgt cccccgcgct ctggttcgca 

gcgtccgtgc tgagcctcct ggccatcgcg 

gggcccgcgc ccgtctccag tcgggggcgc 

gtgtcgctgc tcctcgggct cctgccagcg 

gcttgctcca ctgtcttgcc gctctacgcc 

ttcgtgggca tcctggccgc tatctgtgca 

gccaacgcgc ggcgcctgcc ggcacggccc 

cgtcgcaagc cgcgctcgct ggccttgctg 

gtggcatgtt ggggccccct cttcctgctg 

acctgtcctg tactcctgca ggccgatccc 

ctgaacccca tcatctacac gctcaccaac 

gtctgctgcg gacgccactc ctgcggcaga 

gcggctgagg cttccggggg cctgcgccgc 

agcggctcgg agcgctcatc gccccagcgc 

agccccggtg cacccacagc cgcccggact 

<210> 2 
<211> 398 
<212> PRT 
<213> Homo sapiens 



gtgagcgagg 


tcatcgtcct 


gcattacaac 


60 


cagccgggtg 


ccggcctgcg 


cgccgacgcc 


120 


gtgctagaga 


atctagccgt 


gttgttggtg 


180 


atgttcctgc 


tcctgggcag 


cctcacgttg 


240 


gccaacatcc 


tactgtcggg 


gccgctcacg 


300 


cgggagggag 


gcgtcttcgt 


ggcactcact 


360 


ctggagcgca 


gcctcaccat 


ggcgcgcagg 


420 


acgctggcga 


tggcagccgc 


ggcctggggc 


480 


ctgggctgga 


attgcctggg 


tcgcctggac 


540 


aaggcctacg 


tgctcttctg 


cgtgctcgcc 


600 


ctctacgcgc 


gcatctactg 


ccaggtacgc 


660 


gggactgcgg 


ggaccacctc 


gacccgggcg 


720 


cgcacgctca 


gcgtggtgct 


cctggccttt 


780 


ctgttgctcg 


acgtggcgtg 


cccggcgcgc 


840 


ttcctgggac 


tggccatggc 


caactcactt 


900 


cgcgacctgc 


gccacgcgct 


cctgcgcctg 


960 


gacccgagtg 


gctcccagca 


gtcggcgagc 


1020 


tgcctgcccc 


cgggccttga 


tgggagcttc 


1080 


gacgggctgg 


acaccagcgg 


ctccacaggc 


1140 


ctggtatcag 


aaccggctgc 


agactga 


1197 
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<400> 2 

Met Glu Ser Gly Leu Leu Arg Pro Ala Pro Val Ser GXu Val He Val 
15 10 15 

Leu His Tyr Asn Tyr Thr Gly Lys Leu Arg Gly Ala Arg Tyr Gin Pro 
20 25 30 



10 



Gly Ala Gly Leu Arg Ala Asp Ala Val Val Cys Leu Ala Val Cys Ala 
35 40 45 



15 



Phe He Val Leu Glu Asn Leu Ala Val Leu Leu Val Leu Gly Arg His 
50 55 60 

Pro Arg Phe His Ala Pro Met Phe Leu Leu Leu Gly Ser Leu Thr Leu 
65 70 75 80 



20 



Ser Asp Leu Leu Ala Gly Ala Ala Tyr Ala Ala Asn He Leu Leu Ser 
85 90 95 



25 



Gly Pro Leu Thr Leu Lys Leu Ser Pro Ala Leu Trp Phe Ala Arg Glu 
100 105 110 



Gly Gly Val Phe Val Ala Leu Thr Ala Ser Val Leu Ser Leu Leu Ala 
115 120 125 



30 



He Ala Leu Glu Arg Ser Leu Thr Met Ala Arg Arg Gly Pro Ala Pro 
130 135 140 



35 



Val Ser Ser Arg Gly Arg Thr Leu Ala Met Ala Ala Ala Ala Trp Gly 
145 150 155 160 



Val Ser Leu Leu Leu Gly Leu Leu Pro Ala Leu Gly Trp Asn Cys Leu 
165 170 175 



40 



Gly Arg Leu Asp Ala Cys Ser Thr Val Leu Pro Leu Tyr Ala Lys Ala 
180 185 190 



45 



Tyr Val Leu Phe Cys Val Leu Ala Phe Val Gly He Leu Ala Ala He 
195 200 205 



Cys Ala Leu Tyr Ala Arg He Tyr Cys Gin Val Arg Ala Asn Ala Arg 
210 215 220 



50 



Arg Leu Pro Ala Arg Pro Gly Thr Ala Gly Thr Thr Ser Thr Arg Ala 
225 230 235 240 



55 



Arg Arg Lys Pro Arg Ser Leu Ala Leu Leu Arg Thr Leu Ser Val Val 
245 250 255 
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Leu Leu Ala Phe Val Ala Cys Trp Gly Pro Leu Phe Leu Leu Leu Leu 
260 265 270 

Leu Asp Val Ala Cys Pro Ala Arg Thr Cys Pro Val Leu Leu Gin Ala 
275 280 285 

Asp Pro Phe Leu Gly Leu Ala Met Ala Asn Ser Leu Leu Asn Pro lie 
290 295 300 

lie Tyr Thr Leu Thr Asn Arg Asp Leu Arg His Ala Leu Leu Arg Leu 
305 310 315 320 

Val Cys Cys Gly Arg His Ser Cys Gly Arg Asp Pro Ser Gly Ser Gin 
325 330 335 

Gin Ser Ala Ser Ala Ala Glu Ala Ser Gly Gly Leu Arg Arg Cys Leu 
340 345 350 

Pro Pro Gly Leu Asp Gly Ser Phe Ser Gly Ser Glu Arg Ser Ser Pro 
355 360 365 

Gin Arg Asp Gly Leu Asp Thr Ser Gly Ser Thr Gly Ser Pro Gly Ala 
370 375 380 

Pro Thr Ala Ala Arg Thr Leu Val Ser Glu Pro Ala Ala Asp 
385 390 395 



Claims 

1 . An isolated polynucleotide comprising a nucleotide sequence that has at least 80 % identity to a nucleotide se- 
quence encoding the polypeptide of SEQ ID NO. 2 or the corresponding fragment thereof; or a nucleotide sequence 
complementary to said nucleotide sequence. 

2. The polynucleotide of claim 1 which is DNA or RNA. 

3. The polynucleotide of claim 1 wherein said nucleotide sequence is at least 80 % identical to that contained in SEQ 
IDNO. 1. 

4. The polynucleotide of claim 3 wherein said nucleotide sequence is contained in SEQ ID NO. 1 . 

5. The polynucleotide with sequence SEQ ID NO. 1 . 

6. The polynucleotide of claim 1 wherein said encoding nucleotide sequence encodes the polypeptide of SEQ ID 
NO. 2 or a fragment thereof. 

7. EDG8 DNA or RNA molecule comprising an expression system wherein said expression system is capable of 
producing a polypeptide or a fragment thereof having at least 80 % identity with a nucleotide sequence encoding 
the polypeptide of SEQ ID NO. 2 or said fragment when said expression system is present in a compatible host cell. 
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8. A host cell comprising the expression system of claim 7. 

9. A process for producing a EDG8 polypeptide or fragment comprising culturing a host claim 8 and under conditions 
sufficient for the production of said polypeptide or fragment. 

5 

10. The process of claim 9 wherein said polypeptide or fragment is expressed at the surface of said cell. 

11. Cells produced by the process of claim 10. 

10 12. The process of claim 9 which further includes recovering the polypeptide or fragment from the culture. 

1 3, A process for producing a cell which produces a EDG8 polypeptide or a fragment thereof comprising transforming 
or transfecting a host cell with the expression system of claim 7 such that the host cell, under appropriate culture 
conditions, produces a EDG8 polypeptide or fragment. 

15 

14. EDG8 polypeptide or a fragment thereof comprising an amino acid sequence which is at least 80 % identical to 
the amino acid sequence contained in SEQ ID NO. 2. 



15. Polypeptide of claim 14 which comprises the amino acid sequence of SEQ ID NO. 2, or a fragment thereof. 

20 

16. EDG8 Polypeptide or fragment prepared by the method of claim 12. 

17. A process for diagnosing a disease or a susceptibility to a disease related to expression or acitivity of EDG8 
polypeptide comprising: 

25 

c) determining the presence or absence of mutation in the nucleotide sequence encoding said EDG8 polypep- 
tide in the genome of said subject; and/or 

d) analyzing for the presence or amount of the EDG8 polypeptide expression in a sample derived from said 
subject. 

30 

18. A method for identifying compounds which bind to EDG8 polypeptide comprising: 

c) contacting a cell as claimed in claim 11 or a part thereof with a candidate compound; and 

d) assessing the ability of said candidate compound to bind to said cells. 

35 , 

19. The method of claim 18 which further includes determining whether the candidate compound effects a signal 
generated by activation of the EDG8 polypeptide at the surface of the cell, wherein a candidate compound which 
effects production of said signal is identified as an agonist. 



40 20. The method of claim 18 which further Includes determining whether the candidate compound effects a signal 
generated by activation of the EDG8 polypeptide at the surface of the cell, wherein a candidate compound which 
effects production of said signal is identified as an antagonist. 

21. An agonist identified by the method of claim 19. 

45 

22. An antagonist identified by the method of claim 20. 

23. The method of claim 1 8 which further includes contacting said cell with a known agonist for said EDG8 polypeptide; 
and determining whether the signal generated by said agonist is diminished in the presence of said candidate 

50 compound, wherein a candidate compound which effects a diminution in said signal is identified as an antagonist 

for said EDG8 polypeptide. 

24. A method as claimed in claim 23, wherein the known agonist is Sip, LPA and/or DHS1 P. 
55 25, An antagonist identified by the method of claim 23 or 24. 

26. Method of preparing a pharmaceutical composition comprising 
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a) identifying a compound which is an agonist or an antagonist of EDG8, 

b) preparing the compound, and 

c) optionally mixing the compound with suitable additives. 

Pharmaceutical compound prepared by a process of claim 26. 
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1A: 

1 ATGGAGTCGGGGCTGCTGCGGCCGGCGCCGGTGAGCGAGGTCATCGTCCTGCATTACAAC 
MESGLIiRPAPVSEVIVLHYN 

6 1 TACACCGGCAAGCTCCGCGGTGCGCGCT ACCAGCCGGGTGCCGGCCTGCGCGCCG ACGCC 
YTGKLRGARYQPGAGLRADA 

121 GTGGTGTGCCTGGCGGTGTGCGCCTTCATCGTGCTAGAGAATCTAGCCGTGTTGTTGGTG 
VVCLAVCAFIVLENLAVLLV 

181 CTCGGACGCCACCCGCGCTTCCACGCTCCCATGTTCCTGCTCCTGGGCAGCCTCACGTTG 
LGRHPRFHAPMFLLLGSLTL 

241 TCGGATCTGCTGGCAGGCGCCGCCTACGCCGCCAACATCCTACTGTCGGGGCCGCTCACG 
SDLLAGAAYAANILLSGPLT 

301 CTGAAACTGTCCCCCGCGCTCTGGTTCGCACGGGAGGGAGGCGTCTTCGTGGCACTCACT 
LKLSPALWFAREGGVFVALT 

361 GCGTCCGTGCTGAGCCTCCTGGCCATCGCGCTGGAGCGCAGCCTCACCATGGCGCGCAGG 
ASVLSLLAXALERSLTMARR 

4 21 GGGCCCGCGCCCGTCTCCAGTCGGGGGCGCACGCTGGCGATGGCAGCCGCGGCCTGGGGC 
GPAPVSSRGRTLAMAAAAWG 

4 81 GTGTCGCTGCTCCTCGGGCTCCTGCCAGCGCTGGGCTGGAATTGCCTGGGTCGCCTGGAC 
VSLLLGLLPALGWNCLGRLD 

541 GCTTGCTCCACTGTCTTGCCGCTCTACGCCAAGGCCTACGTGCTCTTCTGCGTGCTCGCC 
ACSTVLPLYAKAYVLFCVLA 

601 TTCGTGGGCATCCTGGCCGCTATCTGTGCACTCTACGCGCGCATCTACTGCCAGGTACGC 
FVGILAAICALYARIYCQVR 

661 GCCAACGCGCGGCGCCTGCCGGCACGGCCCGGGACTGCGGGGACCACCTCGACCCGGGCG 
ANARRLPARPGTAGTTSTRA 

7 21 CGTCGCAAGCCGCGCTCGCTGGCCTTGCTGCGCACGCTCAGCGTGGTGCTCCTGGCCTTT 
RRKPRSLALLRTLSVVLLAF 

781 GTGGCATGTTGGGGCCCCCTCTTCCTGCTGCTGTTGCTCGACGTGGCGTGCCCGGCGCGC 
VACWGPLFLLLLLDVACPAR 

841 ACCTGTCCTGTACTCCTGCAGGCCGATCCCTTCCTGGGACTGGCCATGGCCAACTCACTT 
TCPVLLQADPFLGLAMANSL 

901 CTGAACCCCATCATCTACACGCTCACCAACCGCGACCTGCGCCACGCGCTCCTGCGCCTG 
LNPI lYTLTNRDLRHALLRL 

961 GTCTGCTGCGGACGCCACTCCTGCGGCAGAGACCCGAGTGGCTCCCAGCAGTCGGCGAGC 
VCCGRHSCGRDPSGSQQSAS 

1021 GCGGCTGAGGCTTCCGGGGGCCTGCGCCGCTGCCTGCCCCCGGGCCTTGATGGGAGCTTC 
AAEASGGLRRCLPPGLDGSF 

1081 AGCGGCTCGGAGCGCTCATCGCCCCAGCGCGACGGGCTGGACACCAGCGGCTCCACAGGC 
SGSERSSPQRDGL DTSGSTG 

1141 AGCCCCGGTGCACCCACAGCCGCCCGGACTCTGGTATCAGAACCGGCTGCAGACTGA 
SPGAPTAARTLVSEPAAD* 
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Fig. 1B 

[ edg2_sheep 
' edg2_bovin 
I edg2_mouse 
r edg_xenopus 

[ j edg7_human 

1 edg4_human 

p edg6_raouse 
|_ edg6_human 
r nrgl_rat 
L edg8_human 
r edg5_rat 

edg5_human 
I edgl_rat 

edgl_raouse 
L edgl_humaii 
I— edg_zebraf ish 
I — edg3_human 
I — edg3_fugu 

edg2_human 
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FIG 1C: 



1 60 

edg2_human MAAISTSIPV ISOPQFTAMN EPQCFYNESI AFFYNRSGKH LAT.EWNTVS KLVMGt. .GI 

eclg7_human MN E. .CHYDKHM DFFYNRSNTD TVD.DW.TGT KLVIVLCVGT 

edg4_huinan ™MVI MGQCYYNETI GFFYNNSGKE LSS.HWR..P KDVVVVALGL 

edgl__human MGPTS VPLVKAHRSS VSDYVNYDII VRHYNYTGKL . .NISADKEN SIKLTSVVFI 

edg3_huinan --MATALPPR LQPVRGNSTL REHYQYVGKL AGRLKEASEG S.TLTTVLFL 

edg5_human MGSL YSEYLNPNKV QEHYNYTKE. . .TLETQETT SRQVASAFIV 

edg8_human MESGL LRPAPVSEVI VLHYNVTGKL RG.ARYQPGA GLRADAWCL 

edg6_human MNATG TPVAPESCQQ LAAGGHSRLI VLHYNHSGRL AGR.GGPEDG GLGALRGLSV 



edg2_ 

edg7_ 

edg4_ 

edgl 

edg3" 

edg5_ 

edge_ 

edg6_ 



human 
human 
human 
human 
human 
human 
human 
human 



61 

TVCIFIMLAN 
FFCLFIFFSN 
TVSVLVLLTN 
LICCFIILEN 
VICSFIVLEN 
ILCCAIVVEN 
AVCAFIVLEN 
AASCLVVLEN 



LLVMVAIYVN 
SLVIAAVIKN 
LLVIAAIASN 
IFVLLTIWKT 
LMVLIAIWKN 
LLVLIAVARN 
LAVLLVLGRH 
LLVLAAITSH 



RRFHFPIYYL 
RKFHFPFYYL 
RRFHQPIYYL 
KKFHRPMYYF 
NKFHNRMYFF 
SKFHSAMYLF 
PRFHAPMFLL 
MRSRRWVYYC 



KANLAAADFF 
LANLAAADFF 
LGNLAAADLF 
IGNLALSDLL 
IGNLALCDLL 
LGNLAASDLL 
LGSLTLSDLL 
LVNITLSDLL 



AGLAYFYLMF 
AGIAYVFLMF 
AGVAYLFLMF 
AGVAYTANLL 
AGIAYKVNIL 
AGVAFVANTL 
AGAAYAANIL 
TGAAYLANVL 



120 

NTGPNTRRLT 
NTGPVSKTLT 
HTGPRTARLS 
LSGATTYKLT 
MSGKKTFSLS 
LSGSVTLRLT 
LSGPLTLKLS 
LSGARTFRLA 



edg2_ 
edg7_ 
edg4 
edgl" 
edg3" 
edg5_ 
edgS] 
edg6_ 



human 
human 
human 
human 
human 
human 
human 
human 



121 

VSTWLLRQGL 
VNRWFLRQGL 
LEGWFLRQGL 
PAQWFLREGS 
PTVWFLREGS 
PVQWFAREGS 
PALWFAREGG 
PAQWFLREGL 



IDTSLTASVA 
LDSSLTASLT 
LDTSLTASVA 
MFVALSASVF 
MFVALGASTC 
ASITLSASVF 
VFVALTASVL 
LFTALAASTF 



NLLAIAIERH 
NLLVIAVERH 
TLLAIAVERH 
SLLAIAIERY 
SLLAIAXERH 
SLLAIAIERH 
SLLAIALERS 
SLLFTAGERF 



ITVFR.MQLH 
MSIMR.MRVH 
RSVMA.VQLH 
ITMLK.MKLH 
LTMIK.MRPY 
VAIAK-.VKLY. 
LTMAR.RGPA 
ATMVRPVAES 



TRMSNRRWV 
SNLTKKRVTL 
SRLPRGRWM 
NGSNNFRLFL 
DANKRHRVFL 
GSDKSCRMLL 
PVSSRGRTLA 
GATKTSRVYG 



180 

VIVVIWTMAI 
LILLVWAIAI 
LIVGVWVAAL 
LISACWVISL 
LIGMCWLIAF 
LIGASWLISL 
MAAAAWGVSL 
FIGLCWLLAA 



edg2_ 
edg?" 
edg4 
edgf 
edg3] 
edg5_ 
edg8_ 
edg6_ 



human 
human 
human 
human 
human 
human 
human 
human 



181 

VMGAIPSVGW 
FMGAVPTLGW 
GLGLLPAHSW 
ILGGLPIMGW 
TLGALPILGW 
VLGGLPILGW 
LLGLLPALGW 
LLGMLPLLGW 



NCICDIENCS 
NCLCNISACS 
HCLCALDRCS 
NCISALSSCS 
NCLHNLPDCS 
NCLGHLEACS 
NCLGRLDACS 
NCLCAFDRCS 



NMAPLYSDSY 
SLAPIYSRSY 
RMAPLLSRSY 
TVLPLYHKHY 
TILPLYSKKY 
TVLPLYAKHY 
TVLPLYAKAY 
SLLPLYSKRY 



LVFWAIFNLV 
LVFWTVSNLM 
LAVWALSSLL 
ILFCTTVFTL 
lAFCISIFTA 
VLCVVTIFSI 
VLFCVLAFVG 
ILFCLVIFAG 



TFWMVVLYA 
AFLIMWVYL 
VFLLMVAVYT 
LLLSIVILYC 
ILVTIVILYA 
ILLAIVALYV 
ILAAICALYA 
VLATIMGLYG 



240 

HIFGYVRQRT 
RIYVYVKRKT 
RIFFYVRRRV 
RIYSLVRTRS 
RIYFLVKSSS 
RIYCVVRSSH 
RIYCQVRANA 
AI FRLVQASG 



241 300 

edg2_human MRMSRHSSGP R RNR DTMMSLLKTV VIVLGAFIIC WTPGLVLLLL D.VCCP..QC 

edg7_human NVLSPHTSGS I SRR RTPMKLMKTV MTVLGAFVVC WTPGLWLLL DGLNCR. .QC 

edg4~human QRMAEHVSCH P RYR ETTLSLVKTV VIILGAFVVC WTPGQWLLL DGLGCE..SC 

edgl_human RRLTFR KNISKASRS SENVALLKTV IIVLSVFIAC WAPLFILLLL DV . GCKVKTC 

edg3_human RKVANH NN S ERSMALLRTV VIVVSVFIAC WSPLFILFLI DV.ACRVQAC 

edgS human ADMA A PQTLALLKTV TIVLGVFIVC WLPAFSILLL DY.ACPVHSC 

edg8~human RRLPARPGTA GTTSTRARRK PRSLALLRTL SWLLAFVACWGPLFLLLLL DV.ACPARTC 

edg6_human QKAP RPAARRK ARR..LLKTV LMILLAFLVC WGPLFGLLLA DVFGSNLWAQ 

301 360 

edg2_human DVLAYEKFFL LLAEFNSAMN PIIYSYRDKE MSATFRQILC CQRSENPTGP TESSDRSASS 

edg7~human GVQHVKRWFL LLALLNSVVN PIIYSYKDED MYGTMKKMIC CFSQENP ERRPSR 

edg4_human NVLAVEKYFL LLAEANSLVN AAVYSCRDAE MRRTFRRLLC CACLRQSTRE SVHYTSSAQG 

edgl_human DILFRAEYFL VLAVLNSGTN PIIYTLTNKE MRRAFIRIMS CCKCPSGD S 

edg3_human PILFKAQWFI VLAVLNSAMN PVXYTLASKE MRRAFFRLV. .CNC.LVR G 

edg5 human PXLYKAHYFF AVSTLNSLLN PVIYTWRSRD LRREVLRPLQ CWRPGVGV Q 

edgS'human PVLLQADPFL GLAMANSLLN PIIYTLTNRD LRHALLRLVC CGRHSCGRDP SGS..QQSAS 

edg6~human EYLRGMDWIL ALAVLNSAVN PIIYSFRSRE VCRAVLSFLC CGCLRLGMRG PGDCLARAVE 



361 418 

edg2_human LNHTILAGVH SNDHSW 

edg7_human IPSTVLSRSD TGSQYIEDSI SQGAVCNKST S 

edg4_human GASTRIMLPE NGHPLMTPPF SYLELQRYAA SNKSTAPDDL WVLLAQPNQQ D 

edgl~haman AGKFKRPIIA GMEFSRSK.. .SDNSSHPQK DEGDNPETIM SSGNVNSSS- - — 

edg3_human RGARASPIQP ALDPSRSKSS SSNNSSHSPK VKEDLPHTOP SSCIMDKNAA LQNGIFCN 

edg5_human GRRRVGTPGH HLLPLRSSSS LERGMHMPTS PTFLEGNTW — 

edg8_human AAEASGGLRR CLPPGLDGSF SGSERSSPQR DGLDTSGSTG SPGAPTAART LVSEPAAD 

edg6_human AHSGASTTDS SLRP.RDSFR GSRSLSFRMR EPLSSISSVR SI 
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